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Implemetation  of  the  essential  parts  of  the  SPIRE  speech  research 
facility  has  been  completed.  Speech  signals  can  now  be  digitized, 
analyzed  and  displayed  conveniently.  We  have  now  digitized  approx¬ 
imately  thirty  minutes  of  speech  spoken  by  seven  different  speakers. 
Phonetic  transcriptions  for  some  of  these  words  and  sentences  have 
also  been  entered.  In  addition,  SPIREX,  a  data  base  facility  to 
examine  the  statistical  properties  of  speech  sounds,  has  also  been 
implemented  on  the  Lisp  machine. 

A  study  of  the  structural  constraints  on  the  sound  patterns  in 
American  English  has  also  been  completed.  With  the  help  of  an 
on-line  dictionary,  we  were  able  to  determine  the  various  ways 
with  which  words  can  be  accessed  without  detailed  phonetic  in¬ 
formation.  The  20,000  word  Merriman -Webs ter  Pocket  Dictionary 
has  been  carefully  scrutinized  in  teimsof  the  phonemic  spellings 
and  stress  markers. 

In  order  to  design  the  corpus  to  be  used  to  study  the  acoustic 
cues  to  word  boundaries,  the  Harvard  phonetically  balanced 
sentences  were  carefully  examined.  Our  initial  conclusion  is 
that  the  distribution  of  consonants  with  words  and  across  word 
boundaries  are  sufficiently  rich  that  the  Harvard  sentences  can 
be  used  as  the  initial  corpus.  Data  for  the  pilot  study  were 
re-examined  using  the  SPIREX  facility  for  confirmation. 

A  preliminary  version  of  the  “front-end”  portion  of  the  large 
vocabulary  isolates  word  recognition  system  has  been  implemented. 

The  system  can  now  obtain,  from  the  digitized  waveform,  a  set  of 
acoustic  parameters.  From  these  acoustic  parameters,  the  speech 
signal  is  segmented  into  acoustic  regions.  A  set  of  re-write 
rules  is  then  applied  to  transfer  the  acoustic  segments  into  a 
string  (or  lattice)  of  broad  phonetic  labels.  This  string  is 
used  to  access  a  20,000  word  lexicon,  and  the  result  is  a  set 
of  words  satisfying  the  broad  phonetic  description.  So  far  we 
have  tested  approximately  100  words  spoken  by  two  male  speakers, 
and  the  results  are  promising.  Very  often,  this  procedure  will 
result  in  significant  reduction  (down  to  <1%)  of  the  lexical 
candidates. 

A  preliminary  version  of  the  continuous  digit  recognition  system 
has  been  implemented.  The  system  is  intended  to  utilize  acoustic 
phonetic  knowledge, and  the  knowledge  is  implemented  in  a  modular 
fashion  so  that  it  can  easily  be  extended  to  other  tasks.  The 
current  implementation  will  produce  a  broad  phonetic  description 
of  the  signal,  which  can  then  be  used  to  propose  the  underlying 
digit  strings.  Due  to  the  strong  constraints  on  the  task,  we 
expect  this  initial  broad  classification  will  very  often  produce 
a  unique  digit  string. 
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A  variety  o£  system  software  has  been  written  on  the  Lisp 
machine.  This  includes  device  drivers  for  the  Digital  Sound 
A/D-D/A  converter  and  the  Versatek  Printer  Plotter  and  a 
critical-band  digital  fitter  bank  for  spectrum  analysis. 
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